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DETAILED ACTION 

Response to Amendment 

1 . This communication is in response to the Amendment filed 5 May 2008. 

2. Claims 1, 2, 5-12 and 14-24 are currently pending and claims 3, 4, 13 and 25 
have been canceled. In the Amendment filed 5 May 2008, claims 1 and 14 are 
amended. This action is made Final. 

Information Disclosure Statement 

3. The information disclosure statement (IDS) submitted on 3 July 2008 was filed 
after the mailing date of the Non-Final Rejection on 4 February 2008. The submission 
is in compliance with the provisions of 37 CFR 1 .97. Accordingly, the information 
disclosure statement is being considered by the examiner. 

Claim Rejections - 35 USC § 103 

4. The following is a quotation of 35 U.S.C. 1 03(a) which forms the basis for all 
obviousness rejections set forth in this Office action: 

(a) A patent may not be obtained though the invention is not identically disclosed or described as set 
forth in section 102 of this title, if the differences between the subject matter sought to be patented and 
the prior art are such that the subject matter as a whole would have been obvious at the time the 
invention was made to a person having ordinary skill in the art to which said subject matter pertains. 
Patentability shall not be negatived by the manner in which the invention was made. 

5. This application currently names joint inventors. In considering patentability of 
the claims under 35 U.S.C. 103(a), the examiner presumes that the subject matter of 
the various claims was commonly owned at the time any inventions covered therein 
were made absent any evidence to the contrary. Applicant is advised of the obligation 



Application/Control Number: 10/796,644 Page 3 

Art Unit: 2167 

under 37 CFR 1 .56 to point out the inventor and invention dates of each claim that was 
not commonly owned at the time a later invention was made in order for the examiner to 
consider the applicability of 35 U.S.C. 103(c) and potential 35 U.S.C. 102(e), (f) or (g) 
prior art under 35 U.S.C. 103(a). 

6. Claims 1, 2, 5, 6, 9-12, 14-18 and 21-24 are rejected under 35 U.S.C. 103(a) 
as being unpatentable over US Patent No. 6,584,464 to Warthen (hereafter 
Warthen) in view of US PGPub 2004/0199498 to Kapur et al (hereafter Kapur) in 
view of US PGPub 2004/0260677 to Malpani et al (hereafter Malpani). 

Referring to claim 1, Warthen discloses a method of compressing a log of 
natural language data (see column 4, lines 65-66), the log having a plurality of natural 
language help query strings [questions] (see column 4, lines 38-41), each string being 
including at least two tokens [i.e., Where can I find information on the sport bicycling?] 
(see column 4, lines 32-36). Warthen also discloses the further limitation of wherein 
each string is a query relative to a help function of a computer system [the user asks 
questions of an information server; information server 50 may be built into a kiosk for 
providing answers to kiosk users' questions, or information server 50 might be 
connected to a corporate network for answering customer, supplier and/or customer 
questions] (see column 2, lines 48-50 and column 3, lines 57-67). However, Warthen 
fails to explicitly disclose the further limitations of the actual steps taken to compress the 
log. Kapur discloses receiving various query log files from various sources and then 
processing the logs (see [0035], lines 1-8), including the further limitations of: 
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applying a compression operation [canonicalized] to each string (see [0036], 
lines 3-5); 

identifying two strings that match each other after the compression operation 
[consolidate] (see Fig 5, step 510); and 

removing one of the two matching strings from the log to form a compressed log 
[multiple occurrences of the same query are included as a single query] (see [0035], 
lines 10-13). 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to use the query processing engine disclosed by Kapur to compress the log of 
questions disclosed by Warthen. One would have been motivated to do so in order to 
produce a set of questions which improve the process of determining the context of a 
user query and then associating the most useful result with the query (Warthen: see 
column 1 , lines 43-51 ) in order to produce a set of questions which improve the process 
of determining the context of a user query and then associating the most useful result 
with the query. 

The combination of Warthen and Kapur (hereafter Warthen/Kapur) fails to 
explicitly disclose the further limitation of feeding the compressed log back to the 
compression operation at least once. However, it would have been obvious to one of 
ordinary skill in the art at the time of the invention to apply the steps of compressing a 
log to the compressed log. One would have been motivated to do so since this is a 
mere duplication of parts and the mere duplication of parts has no patentable 
significance unless a new and unexpected result is produced (see In re Harza, 274 F.2d 



Application/Control Number: 10/796,644 Page 5 

Art Unit: 2167 

669, 124 USPQ 378 (CCPA 1960)). In this instance, applying compression to a 
compressed log merely further compresses the log. 

Warthen/Kapur fails to explicitly disclose the further limitation of training a 
statistical process with the log. Wang discloses search category classification (see 
abstract), including the further limitation of training a statistical process [classification 
component 120 which implements a statistical model] with the log [training data source] 
(see [0030] and [0031]). 

It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to utilize the compressed log of Warthen/Kapur to train a statistical 
process as disclosed by Malpani since the classification component of Malpani can be 
trained from one or more of a number of sources (Malpani: see [0021]). One would 
have been motivated to do so in order to increase search providers in understanding 
user intent and providing optimal search results and content to querying users (Kapur: 
see [0005] and [0006]). 

Referring to claim 2, the combination of Warthen/Kapur and Malpani (hereafter 
Warthen/Kapur/Malpani) discloses the method of claim 1 , wherein the log is a log of 
user-initiated inputs [users' questions] to a help interface [client interface] (Warthen: see 
column 3, lines 59-67). 

Referring to claim 5, Warthen/Kapur/Malpani discloses the method of claim 1, 
wherein the compression operation is character-based [removing odd symbols] (Kapur: 
see [0039]-[0048]). 
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Referring to claim 6, Warthen/Kapur/Malpani discloses the method of claim 1 , 
wherein the compression operation is token-based (Kapur: see [0036], lines 23-28). 

Referring to claim 9, Warthen/Kapur/Malpani discloses the method of claim 1 , 
and further comprising: 

applying a second compression operation [tokenized] to each string (Kapur: see 
[0036], lines 23-28); 

determining if any two strings match each other after the second compression 
operation [convergence] (Kapur: see [0038], lines 1-2); and 

removing one of the two matching strings from the log (Kapur: see [0038], lines 

5-6). 

Referring to claim 10, Warthen/Kapur/Malpani discloses the method of claim 9, 
wherein the first compression operation is character-based [canonicalize - item 500] 
and the second compression operation is token based [tokenize - item 520] (Kapur: see 
Fig 5). 

Referring to claim 11, Warthen/Kapur/Malpani discloses the method of claim 

10, and further comprising applying subsumption [Generation of Extensions, 
Associations and Alternatives - items 570, 580 and 590] after the second compression 
operation [tokenize] is complete (Kapur: see Fig 5). 

Referring to claim 12, Warthen/Kapur/Malpani discloses the method of claim 

1 1 , wherein the subsumption operation is repeated for the log [the log would be 
processed by the processing engine one more time] (Kapur: see [0035], lines 3-8). 
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Referring to claim 14, Warthen discloses a system for compressing a query log 
having a plurality of natural language help query strings [questions] (see column 4, lines 
38-41 and 65-66), each string having a plurality of tokens [i.e., Where can I find 
information on the sport bicycling?] (see column 4, lines 32-36). Warthen also discloses 
the further limitation of the log of natural language help query strings being relative to a 
help function of a computer [the user asks questions of an information server; 
information server 50 may be built into a kiosk for providing answers to kiosk users' 
questions, or information server 50 might be connected to a corporate network for 
answering customer, supplier and/or customer questions] (see column 2, lines 48-50 
and column 3, lines 57-67). However, Warthen fails to explicitly disclose the further 
limitations of the actual steps taken to compress the log. Kapur discloses receiving 
various query log files from various sources and then processing the logs (see [0035], 
lines 1-8), including the further limitations of: 

an input for receiving a raw query log (see [0035], lines 3-8); 

memory [memory or database file 310] for storing the raw query log (see [0035], 
lines 19-31); 

a processor [query processing engine 300] (see Fig 3) for applying at least one 
compression operation [canonicalized] to each string (see [0036], lines 3-5), and for 
scanning the modified strings to determine if any match each other [consolidate] (see 
Fig 5, step 510) so that one of the matching strings can be removed to form a 
compressed log (see [0035], lines 10-13); and 
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wherein the processor is configured to utilize the compressed query log once the 
removal is complete (see [0025]) in order to produce a set of questions which improve 
the process of determining the context of a user query and then associating the most 
useful result with the query. 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to use the query processing engine disclosed by Kapur to compress the log of 
questions disclosed by Warthen. One would have been motivated to do so in order to 
produce a set of questions which improve the process of determining the context of a 
user query and then associating the most useful result with the query (Warthen: see 
column 1, lines 43-51). 

Warthen/Kapur fails to explicitly disclose the further limitation of feeding the 
compressed log back to the compression operation at least once. However, it would 
have been obvious to one of ordinary skill in the art at the time of the invention to apply 
the steps of compressing a log to the compressed log. One would have been motivated 
to do so since this is a mere duplication of parts and the mere duplication of parts has 
no patentable significance unless a new and unexpected result is produced (see In re 
Harza, 274 F.2d 669, 124 USPQ 378 (CCPA 1960)). In this instance, applying 
compression to a compressed log merely further compresses the log. 

Warthen/Kapur fails to explicitly disclose the further limitation wherein the 
processor is configured to utilize the query log to train a statistical process. Wang 
discloses logging queries in a log database, including the further limitation wherein the 
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processor is configured to utilize the query log to train a statistical process [FAQ 
matcher] (see [0049]). 

Warthen/Kapur fails to explicitly disclose the further limitation of wherein the 
processor is configured to utilize the query log to train a statistical process. Wang 
discloses search category classification (see abstract), including the further limitation of 
wherein the processor is configured to utilize the query log [training data source] to train 
a statistical process [classification component 120 which implements a statistical model] 
(see [0030] and [0031]). 

It would have been obvious to one of ordinary skill in the art at the time the 
invention was made to utilize the query log of Warthen/Kapur to train a statistical 
process as disclosed by Malpani since the classification component of Malpani can be 
trained from one or more of a number of sources (Malpani: see [0021]). One would 
have been motivated to do so in order to increase search providers in understanding 
user intent and providing optimal search results and content to querying users (Kapur: 
see [0005] and [0006]). 

Referring to claim 15, Warthen/Kapur/Malpani discloses the system of claim 14, 
wherein each string is a query relative to a help function (Warthen: see column 3, lines 
59-67). 

Referring to claim 16, Warthen/Kapur/Malpani discloses the system of claim 15, 
wherein each help-related query is relative to a computer system [corporate network 
answering employee questions] (Warthen: see column 3, lines 59-67). 
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Referring to claim 17, Warthen/Kapur/Malpani discloses the system of claim 14, 
wherein the compression operation is character-based [removing odd symbols] (Kapur: 
see [0039]-[0048]). 

Referring to claim 18, Warthen/Kapur/Malpani discloses the system of claim 14, 
wherein the compression operation is token-based (Kapur: see [0036], lines 23-28). 

Referring to claim 21, Warthen/Kapur/Malpani discloses the system of claim 14, 
and further comprising: 

applying a second compression operation [tokenized] to each string (Kapur: see 
[0036], lines 23-28); 

determining if any two strings match each other after the second compression 
operation [convergence] (Kapur: see [0038], lines 1-2); and 

removing one of the two matching strings from the log (see [0038], lines 5-6). 

Referring to claim 22, Warthen/Kapur/Malpani discloses the system of claim 21 , 
wherein the first compression operation is character-based [canonicalize - item 500] 
and the second compression operation is token based [tokenize - item 520] (Kapur: see 
Fig 5). 

Referring to claim 23, Warthen/Kapur/Malpani discloses the system of claim 22, 
and further comprising applying subsumption [Generation of Extensions, Associations 
and Alternatives - items 570, 580 and 590] after the second compression operation 
[tokenize] is complete (Kapur: see Fig 5). 



Application/Control Number: 10/796,644 Page 1 1 

Art Unit: 2167 

Referring to claim 24, Warthen/Kapur/Malpani discloses the system of claim 23, 
wherein the subsumption operation is repeated for the log [the log would be processed 
by the processing engine one more time] (Kapur: see [0035], lines 3-8). 

7. Claims 7, 8, 19 and 20 are rejected under 35 U.S.C. 103(a) as being 
unpatentable over US Patent No. 6,584,464 to Warthen (hereafter Warthen) in view 
of US PGPub 2004/0199498 to Kapur et al (hereafter Kapur) in view of US PGPub 
2004/0260677 to Malpani et al as applied to claims 1 and 14 above respectively, 
and further in view of US Patent No 6,493,721 to Getchius et al (hereafter 
Getchius). 

Referring to claim 7, Warthen/Kapur/Malpani disclose a compression operation, 
however, Warthen/Kapur/Malpani fail to explicitly disclose the further limitation wherein 
the compression operation is subsumption. Getchius discloses optimizing search 
queries (see abstract), including the further limitation of subsumption (see column 24, 
lines 7-22). 

It would have been obvious to one of ordinary skill in the art at the time of the 
invention to utilize the concept of subsumption as disclosed by Getchius as the type of 
compression performed by Warthen/Kapur/Malpani. One would have been motivated to 
do so in order to increase the accuracy of determining if two strings are duplicates. 

Referring to claim 8, Warthen/Kapur/Malpani/Getchius discloses the method of 
claim 7, wherein subsumption includes applying an impossibility condition to selectively 
compute edit distance [edit distance d] (Kapur: see [0048], lines 13-31). 
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Referring to claim 19, the claim is rejected on the same grounds as claim 7. 

Referring to claim 20, Warthen/Kapur/Malpani/Getchius discloses the system of 
claim 19, wherein subsumption includes applying an impossibility condition to 
selectively compute edit distance [edit distance d] (Kapur: see [0048], lines 13-31). 

Response to Arguments 

8. Applicant's arguments filed on page 6 of the remarks have been fully considered 
but they are not persuasive. The applicant states that "while Warthen may provide a log 
of linguistic help query strings (questions) such questions are not relative to a help 
function of a computer system." It is noted that the applicant failed to point out how 
specifically Warthen's questions differ from those of applicant. Thus, the examiner 
respectfully disagrees. Warthen discloses that "the user asks questions of an 
information server" at column 2, lines 48-50 and that "information server 50 might be 
built into a kiosk for providing answers to kiosk users' questions, or information server 
50 might be connected to a corporate network for answering customer, supplier and/or 
employee questions" at column 3, lines 57-67. The examiner fails to see how the 
purpose of the question differs and how the type of question would require a different 
response or steps to be performed than what is accomplished by Warthen. 

9. Applicant's arguments with respect to the argument on pages 6-7 of the Remarks 
that Kapur fails to provide the step of re-compressing the compressed log have been 
considered but are moot in view of the new ground(s) of rejection. 
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Conclusion 

1 0. Applicant's amendment necessitated the new ground(s) of rejection presented in 
this Office action. Accordingly, THIS ACTION IS MADE FINAL. See MPEP 
§ 706.07(a). Applicant is reminded of the extension of time policy as set forth in 37 
CFR 1.136(a). 

A shortened statutory period for reply to this final action is set to expire THREE 
MONTHS from the mailing date of this action. In the event a first reply is filed within 
TWO MONTHS of the mailing date of this final action and the advisory action is not 
mailed until after the end of the THREE-MONTH shortened statutory period, then the 
shortened statutory period will expire on the date the advisory action is mailed, and any 
extension fee pursuant to 37 CFR 1 .136(a) will be calculated from the mailing date of 
the advisory action. In no event, however, will the statutory period for reply expire later 
than SIX MONTHS from the date of this final action. 
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Contact Information 

Any inquiry concerning this communication or earlier communications from the 
examiner should be directed to KIMBERLY LOVEL whose telephone number is 
(571)272-2750. The examiner can normally be reached on 8:00 - 4:00. 

If attempts to reach the examiner by telephone are unsuccessful, the examiner's 
supervisor, John Cottingham can be reached on (571) 272-7079. The fax phone 
number for the organization where this application or proceeding is assigned is 571- 
273-8300. 

Information regarding the status of an application may be obtained from the 
Patent Application Information Retrieval (PAIR) system. Status information for 
published applications may be obtained from either Private PAIR or Public PAIR. 
Status information for unpublished applications is available through Private PAIR only. 
For more information about the PAIR system, see http://pair-direct.uspto.gov. Should 
you have questions on access to the Private PAIR system, contact the Electronic 
Business Center (EBC) at 866-217-9197 (toll-free). If you would like assistance from a 
USPTO Customer Service Representative or access to the automated information 
system, call 800-786-9199 (IN USA OR CANADA) or 571-272-1000. 

/John R. Cottingham/ Kimberly Lovel 

Supervisory Patent Examiner, Art Unit 2167 Examiner 

Art Unit 2167 

15 August 2008 
kml 
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